Study of K-NN Evaluation for Text Categorization using Multiple Level Learning
نویسندگان
چکیده
منابع مشابه
Study of K-NN Evaluation for Text Categorization using Multiple Level Learning
Predefined category exists for text categorization. In a document, text may be of any type category like government, education or health etc. many methods exist in market invented by researchers for text categorization. One of them is k-NN (k nearest neighbor) algorithm. k play a role to define number of classes for categorization. A training set is generated for each type of category to check ...
متن کاملLearning Context for Text Categorization
This paper describes our work which is based on discovering context for text document categorization. The document categorization approach is derived from a combination of a learning paradigm known as relation extraction and an technique known as context discovery. We demonstrate the effectiveness of our categorization approach using reuters 21578 dataset and synthetic real world data from spor...
متن کاملArabic Text Categorization using Machine Learning Approaches
Arabic Text categorization is considered one of the severe problems in classification using machine learning algorithms. Achieving high accuracy in Arabic text categorization depends on the preprocessing techniques used to prepare the data set. Thus, in this paper, an investigation of the impact of the preprocessing methods concerning the performance of three machine learning algorithms, namely...
متن کاملImproved Graph Based K-NN Text Classification
This paper presents an improved graph based k-nn algorithm for text classification. Most of the organization are facing problem of large amount of unorganized data. Most of the existing text classification techniques are based on vector space model which ignores the structural information of the document which is the word order or the co-occurrences of the terms or words. In this paper we have ...
متن کاملLearning-Free Text Categorization
In this paper, we report on the fusion of simple retrieval strategies with thesaural resources in order to perform large-scale text categorization tasks. Unlike most related systems, which rely on training data in order to infer text-to-concept relationships, our approach can be applied with any controlled vocabulary and does not use any training data. The first classification module uses a tra...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2015
ISSN: 0975-8887
DOI: 10.5120/21855-5151